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Abstract 



A recent problem in community ecology lies in defining structures 
behind matrices of species interactions. The interest in this area is to 
quantify the nestedness degree of the matrix after its maximal packing. 
In this work we evaluate nestedness using the sum of all distances 
of the occupied sites to the vertex of the matrix. We calculate the 
distance for two artificial matrices with the same size and occupancy: a 
random matrix and a perfect nested one. Using these two benchmarks 
we develop a nestedness estimator. The estimator is applied to a set 
of 23 real networks of insect-plant interactions. 

Keywords: nestedness, networks, insect-plant interactions 

1 Introduction 

Networks have been widely used to describe systems in a multitude of disci- 
plines, such as genetic networks, protein networks or the Internet. In ecology, 
networks are mainly used to visualize and describe food webs. But not only 
trophic interactions are the focus of attention. In the last years researchers 
show a growing interest in the study of other species interactions such as para- 
sitism (Vzquez et al., 2005), scavenger species (Selva and Fortuna, 2007) and 
mostly mutualism (Bascompte and Jordano, 2007 and references therein). 
Studies on mutualistic food webs focus on specific pairwise interactions be- 
tween a plant and an animal and how they are shaped by a community con- 
text, either in a single locality, or geographically (Bascompte and Jordano, 
2007). 

Pairwise interactions can be described in the form of a bipartite graph or 
an interaction matrix. These webs arc characterized by nodes that represent 
species or species groups and observed interactions are drawn as links that, 
when not binary, can render their intensity or frequency in graded thick- 
nesses. In the interaction matrix, links are represented as nonzero cells on 
the intersection of a row and column. According to Almeida- Neto et al.,2007. 
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bipartite webs do in fact offer several advantages of their own: first, tfiey are 
often fully resolved, without the problems of uneven resolution which haunt 
the analysis of complete webs. Second, all links are of a single kind of eco- 
logical interaction (e.g. mutualism), which ensures structural integrity as 
well as similar ecological and evolutionary processes throughout the entire 
assemblage. 

The most studied structure within a bipartite graph is the nested pattern 
of species interactions, although other structures are also possible (Prado et 
al. 2006; Almeida-Neto et al., 2007). In nested assemblages, plants with few 
interactions are related only with generalist animals; conversely, specialized 
animals are found related to plants with many links, that is, with large 
associated faunas. Moreover, generalists in one species set tend to interact 
with generalists in the other, forming a dense core of interactions (Prado et 
al., 2006; Bascompte and Jordano, 2007). 

A nested structure is very cohesive and stable. The fact that few species 
are involved in many interactions (functional redundancy), poses the commu- 
nity with the possibility for alternative routes if some interactions disappear 
(Bascompte and Jordano, 2007). A nested structure is also quite robust: 
it is less prone to sampling bias than number of species and links (Nielsen 
and Bascompte, 2007) and not generated by the random combination of sets 
of plants and animals solely in proportion to their different abundances as 
previously thought (Prado et al., 2006). 

Recently, a large series of mutualistic interaction assemblages have shown 
a significantly nested structure (Bascompte et al. 2003). Substantial ef- 
fort has been done in developing various measures and forms of calculating 
nestedness (Atmar and Patterson, 1993; Guimaraes and Guimaraes, 2006; 
Rodrguez-Girons and Santamara, 2006; Almeida-Neto et al., submitted). 
The most commonly used nestedness metric is the Nestedness Temperature 
Calculator, or Nestcalc, used to calculate nestedness in binary matrices (At- 
mar and Patterson, 1993). The nestedness from this algorithm has a problem, 
though: the absolute value of the nestedness temperature is dependent on 
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matrix size and fill. Some studies show that the nestedness temperature of 
randomly assembled matrices increases with network size and attains its max- 
imum value for intermediate fills (Rodrguez-Girons and Santamara, 2006; 
Almeida- Neto et al., submitted). So, smaller networks need lower tempera- 
ture than larger ones do to be significantly nested (Nielsen and Bascompte, 
2007). 

From a mathematical point of view, the central object in the discussion 
about nestedness is a matrix of zeros and ones. The ecologist in the field 
interpret this matrix as a table where she (he) marks a cross at the i column 
and j row each time a species of group one i (e.g. plant) is related to group 
two j(e.g. insect). In order to visualize nestedness in the studied ecological 
community the ecologist has to rank rows and columns of the table. In fact, 
each time one row (or cohimn) is permuted to another row (or to column), 
the interactions among species of groups one and two do not change. For a 
matrix of Li species in group one and L2 species in group 2, there are LiX L2 
possibilities to represent the matrix following different permutations of rows 
and columns. Each one of these possibilities is just a different visualization 
of the same network structure. 

Ranking rows and columns is a very practical option to visualize nest- 
edness in a interaction matrix. When we rank the elements of a matrix we 
choose one of the Li x L2 possibihtes, that one where the elements of the 
matrix are the most packed. In other words, we choose the representations 
where the elements of the matrix are as closest as possible from the i — 1, 
j — 1 corner. In the literature . Packing procedure is a previous step before 
the evaluation of a nestedness index (Atmar and Patterson, 1993). In this 
article we also pack the matrix before the evaluation of our nestedness index. 

We introduce here a new nestedness measure applied to digraphs origi- 
nated from ecological data. However, the method is more general than its 
predecessor and can be naturally applied to graphs (networks) other than 
digraphs. In section 2 we describe the formal objects used in this article: 
adjacency matrix, Manhattan distance in a matrix, projection of a generic 
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matrix into the unit square lattice, packing process, random matrix, max- 
imum nested matrix and nestedness estimator. In section 3 we apply the 
nestedness estimator to a set of insect-plant herbivory networks extracted 
from the community ecology literature. In section 4 we summarize the arti- 
cle, point out potential applications of the method and give the final words. 

2 A measure of nestedness 

In this section we introduce the concept of distance in a matrix to characterize 
the nestedness of digraphs. In order to fix the notation we call digraph an 
object D formed by two sets of vertices Di and D2 and a set of hnks between 
these two sets. The digraph is completely described by the adjacency matrix, 
M , of size Li x L2, where Li and L2 are the number of elements of Di and 
D2, respectively. By definition Mij = 1 if there is a link between vertices 
i of Di and j of D2 and Mij = if i and j arc not linked. It is useful to 
visualize M as a Li versus L2 lattice with empty (zero) or full (one) sites. 
Moreover, the number of links of a vertex / is ki and the distribution of links 
of Di and D2 is Pi-^ and Pi,^ respectively. 

In Ecology, the field data corresponding to the digraph is composed by 
two sets of species and the corresponding links (interactions) between them. 
As we pointed out in the introduction, the standard procedure in this area 
consists in packing the adjacency matrix of the data. The packing is per- 
formed in the following way: the link distributions Pi^ and Pi^ are ordered 
such that the most connected species go to the first position of the matrix. 
In this way the matrix M shows more ones close to z = 1 and j = 1 corner 
and zeros at the opposite corner, i — Li and j = L2. Prom the matrix point 
of view, the packing process consists in replacing lines and columns until 
Pi^ and P12 are ordered. We emphasize that since the packing process do 
not change the links between species it does not alter the phenomenology 
underlying the network. 

The idea behind packing the matrix M is to better visualize network 
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nestedness. In addition, nestedness is related with the dispersion of ones 
and zeros after the packing process. A very nested matrix is one that, after 
packing, has a minimal mixing of ones and zeros. Using a lattice analogy, a 
very nested lattice shows a minimum of holes. 

To introduce distance properties in the original matrix M we map it into 
a Cartesian space. In order to avoid distortions we map the Li x L2 matrix 
to the unit square. To perform this task the cell elements assume the 
positions Xj and Ui that are done by: 

x, = {i- + l/(2Li) 

y, = (j - + 1/(2L2) (1) 

In this article we use the Manhattan distance because it is broadly em- 
ployed to measure matrix distances. Euclidean distance is used for estimating 
distance between elements apart in continuum space, which is not the case 
here. In fact, in the context of abstract metric spaces (Courant, R. and 
Hilbert, D., 1937) set of distances = (x^ + y^y/^ that depends on the 
parameter x, the case x = 1 corresponds to the Manhattan distance and the 
case X = 2 to the Euchdean distance. 

We define the occupancy number p as the fraction of occupied sites in the 
adjacency matrix. For the total number of ones in M we have p = j^j^- 
To quantify nestedness of a given matrix M we use two matrix benchmarks 
with the same Li, L2 and p: the maximal nested matrix M and the random 
matrix Mrand- The maximal nested matrix is constructed in such a way that 
it has no holes and its elements are as close as possible to the (1, 1) corner. 
We construct M filling the elements along equidistant diagonals to (1, 1). In 
fact all elements along the same diagonal have the same distance to the (1, 1) 
corner. The construction of M is the following: the first element occupied is 
(1, 1), after that it comes (1, 2) and (2, 1), followed by (1, 3), (2, 2) and (3, 1), 
etc. Figure [1] illustrates the optimal filling strategy to build M. In contrast, 
the random matrix is constructed in such a way that all its elements are 
uniformly occupied with the same probability p = p- The maximum packed 
matrix of the species interaction will be in-between these two. 
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We use the Manhattan distance to evaluate the distances of the filled 
elements Xi, yj (the distances are defined for the matrix elements projected 
into the unit square in the cartesian plane). We call d the sum over the 
distances of all the elements of the matrix projected into the unit square, 
that means, d = J^dij for di j = Xi + yi. In order to define the nestedness 
estimator we introduce two additional distances: the total distance of the 
artificial matrices M and Mrand- We note that M has the smallest total 
distance among all the lattices with the same p and we call dmm its total 
distance, while the total distance of Mrand is drand- Consider a sample of 
points ramdomly distributed along the unit square, the expected value of the 
distance to the origin, fi, is the Manhatann distance from the origin to the 
center of the square of size 1, that means, fi = 1. Therefore: 

N 

drand = E ^M' = N fl = N (2) 
fc=l 

To get an insight about distances in M we start exploring the behavior 
of dmin and drand agaiust occupancy p in figure [2l We use in this picture 
Li = L2 = 20. As expected, the distances follow the relation dmin < drand- 
The total distance d for any matrix, after the packing process, shows the 
property: 

dmin ^ d drand- (3) 

In fact, dmin < d since dmin is derived from an artificial matrix whose compo- 
nents, by construction, have the minimal distance to the origin. Otherwise, 
d < drand because d is derived from a packed matrix, and in the packing 
process the matrix reduces the distances of their elements when compared 
with a similar random matrix. 

The distance as defined above depends on the matrix size and the occu- 
pation. In fact, the total distance observes the relation d oc L"^ for a given p 
and the relation d oc A^^ for a constant L. This behavior can be visualized 
in figures [2] and [31 In order to have a A^ free nestedness index of the system 
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we define the nestedness index rj as follows: 

= 1 

'-''rand '^min 

We emphasize that dmin and drand are computed over a artificial matrix with 
the same Li, L2 and p of the original system. In the next section we test 
7] over a set of digraphs from the context of community ecology and discuss 
the results. 



3 Results 

In this chapter we select a set of 23 insect-plant herbivory networks in the 
literature and apply the nestedness index we develop in this article. In table 
HJwe enumerate the set of networks with its main properties: the occupancy 
p, size Li and L2, the nestedness estimator rj, the temperature T (according 
to Atmar and Patterson, 1993) and the reference of the network. A visual 
inspection of the table does not reveal correlation between r] and T. In fact, 
a linear correlation analysis between the two variables revels no significant 
correlation [R = 0.19 and p = 0.18). 

The range of values of our estimator is 0.17 < rj < 0.83 and the average 
value is f] = 0.45. In contrast, the usual temperature estimator have the 
range 6.8 < T < 43 and average value p = 18.6. In order to improve the 
visual intuition about the problem we plot in figure 4 four lattices of insect- 
plant networks. It is clear in the figure that (a) and (b) are highly nested, 
and that on the contrary, (c) and (d) are not nested at all. This intuitive 
idea is corroborated by the r] estimator, but not by the temperature, the 
estimator of the two initial matrices are 0.18 and 0.23 (low values), and of 
the last two 0.79 and 0.76 (high values). The temperature estimator, on the 
other hand, shows an intermediary value in case (a), where our estimator 
shows a low value and a very high temperature in (c) where our estimator 
points for a very large value. In fact, figures (c) and (d) have a large number 
of specialists, and in consequence the matrix cannot be well nested. Our 
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estimator corroborates this observation. As a final remark concerning this 
set of figures, we point that all the matrices in this figure are well packed, 
that means, the number of elements of lines and columns are ordered. On 
the other hand, the nestedness calculator from Atmar and Patterson, 1993, 
usually fails in packing well the matrices. 

From the observation of figure H] we see that the variables are correlated, 
in fact, a linear correlation analysis reveals R = —0.74, t = 5.0 and p = 
5.9 X 10~^. An exponential correlation regression results R = —0.86, t = 6.3 
and p = 2.9 x 10~^. Therefore the adjust of the data to the exponential curve 
is slightly better than the linear one. The dependence of a nestedness index 
to occupancy was already pointed in the literature (Girons and Santamara, 
2006). At first, a relation between p and t] seems intuitive: once the number 
of sites increase, in the average, they will be more nested after the packing 
process. We let for a future work a more carefuU analysis of this point. 

4 Final Remarks 

In this work we develop a new nestedness estimator t] based on distances 
over the adjacency matrix of the network. We think that this estimator will 
be useful in the methodological discussion involving nestedness in commu- 
nity ecology. To make the method clearer to the reader we summarize the 
algorithm to find rj in the following sequence of steps: 

1. Evaluate the link distributions P^^ and Pl^ of the adjascency matrix 
of the network. 

2. Pack the matrix, that means, permute lines and columns of the matrix 
in order that P^^ and P^^ are ranked. This step defines a corner of 
nestedness. 

3. Project the matrix into the unit square in order to avoid distortion due 
to the diferences between the sizes Li and L2 of the matrix. 
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4. Find the manhatann distance d^^y of all elements Xi i/i of the matrix 
and sum to find the total distance of the elements of the matrix d. 

5. Determine analitically the distance of the associated random matrix 
with the same occupancy p: drand = for N the total number of 
occupied elements of the matrix. 

6. Determine computationally the distance of the asociated maximally 
nested matrix with the same p: dmin- 

7. Finally, calcule the estimator ri = d.-d^i„ — _ 

As estimated above, < r/ < 1. In the limit rj ^ the network is 
completely nested and t] ^ corresponds to the random limit. We tested 
our estimator for a set of 23 insect-plant networks and the data is summarized 
in table HI An interesting result of our estimator is that it depends on the 
occupancy number. This result is in agreement with the intuitive idea that 
the matrix nestedness increases with its occupancy density. 

The parameter T is a very used nestedness estimator in community ecol- 
ogy. This parameter, despite of its popularity, is not well defined and present 
several problems (Fischer and Lindenmayer, 2002; Rodrguez-Girons and San- 
tamara, 2006; Almeida-Neto et al., submitted). We are perfectly aware that 
our estimator will be compared with the usual temperature estimator devel- 
oped by Atmar and Patterson (1993). What we should do is to show the 
good points of our method and let the methodological discussion to the sci- 
entific community. In this way we stress the strong points of our method in 
the following: 

1. Our algorithm is based on plain geometry and metric statements. In 
this way it is simple and can be calculated with help of a easy computer 
program. 

2. We have two benchmarks clearly defined: the total distance of the 
random matrix and of the completely nested one. 
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3. We do not use any ad hoc parameter in the equation that defines the 
nestedness estimator. 

4. Our estimator gives a number between zero and one. 

5. The visual inspection criterion of nestedness of empirical matrices agrees 
with our estimator. 

This paper opens a new perspective in the study of nestedness. We de- 
velop an original index to measure nestedness that is based on direct metric 
analysis of the matrix. Instead of considering the dispersion of elements 
around an artificial isocline, we estimate directly the distances of all the 
matrix elements from the packing corner. The nestedness of a matrix is a 
measure of how much the elements of the matrix are close to the corner 
where the matrix is packed. We hope this paper to be usefull to improve the 
understanding of nestedness in the community ecology context. 
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Legend table 



Table 1: Set of 23 insect-plant community networks. For each network we 
show : the occupancy p, size Li and L2, the ncstedness estimator rj (multiplied 
by 100), the temperature T and the reference in the literature. 
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Figure captions 



Figure 1: Lattice representation of a particular lattice with Li = L2 = 5 and 
N= 8 (empty circles). The filled diamonds show the maximal nested lattice 
for the same Li, L2 and N. We remark that the circles represent the lattice 
after the packing process, both and Pl^ ranked. 

Figure 2: The behavior of dmin and drand versus p for Li = L2 = 20. The 
curves obey the rule dmin < drand except for p — >^ 1. 

Figure 3: The behavior of dmin and drand versus L for a constant occupation 
p — 0.5. For both curves we have d oc I?. 

Figure 4: Four examples of real community matrices referred to data files on 
table 1. In (a) the matrix corresponds to file 10, in (b) we use file 9, in (c) 
file 2 and in (d) file 12. 

Figure 5: The curve of occupancy p, versus nestedness index 7^, for all the 
network tested in this work. 
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File 


Fill(%) 


Size (Plant x Insect) 


T 


r] X 100 


Reference 


1 


42.9 


18 X 57 


28.0 


17.2 


Futuyama and Gould 1979 


2 


10.5 


10 X 18 


43.2 


79.1 


Dawah ct al. 1995 


3 


20.0 


43 X 14 


10.0 


24.3 


Buruga and Olembo 1971 


4 


8.4 


63 X 25 


6.9 


28.1 


Buruga and Olembo 1971 


5 


8.6 


33 X 55 


13.5 


40.8 


Claridge and Wilason 1981 


6 


1.1 


107 X 104 


7.29 


83.5 


Flowers and Janzen 1997 


7 


16.0 


52 X 22 


20.7 


35.1 


Joern 1979 


8 


13.3 


54 X 24 


11.0 


32.3 


Jocrn 1979 


9 


34.7 


11 X 11 


9.4 


23.0 


Piolou 1974 


10 


43.5 


13 X 12 


19.3 


18.1 


Pielou 1974 


11 


4.7 


52 X 27 


14.5 


62.7 


Jermy and Szentesi 2003 


12 


2.0 


53 X 92 


8.3 


75.9 


Mcnimot et al. 1994 


13 


5.5 


46 X 22 


20.1 


62.8 


Neck 1976 


14 


14.7 


08 X 11 


42.1 


68.1 


Pipkin ct al. 1966 


15 


4.4 


55 X 43 


8.3 


53.5 


Podlussany ct al. 2001 


16 


23.0 


13 X 09 


35.8 


42.3 


Ratchkc 1976 


17 


52.3 


15 X 08 


17.2 


18.2 


Sheldon and Rogers 1978 


18 


15.9 


18 X 15 


26.3 


45.3 


Prado and Lcwinsohn 1994 


19 


4.5 


33 X 29 


20.5 


69.7 


Valladarcs, Argentina (unpublished) 


20 


8.9 


27 X 22 


17.8 


53.7 


Lcwinsohn. Brazil (unpublished) 


21 


8.0 


30 X 34 


14.1 


44.1 


Lcwinsohn, Brazil (unpublished) 


22 


12.8 


33 X 21 


17.7 


37.9 


Almeida, Brazil (unpublished) 


23 


19.6 


21 X 32 


20.6 


29.3 


Kubota, Brazil (unpublished) 
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Figure 1 
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Figure 2 




22 




Figure 4a 
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Figure 4b 
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Figure 4d 
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